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ABSTRACT 

The!*  problem  of  quantile  selection  for  the  asymptotically  best 
linear  unbiased  estimators  of  location  and  scale  parameters  is 
considered.  The  asymptotic  properties  of  several  quantile  selec¬ 
tion  methods  for  simultaneous  parameter  estimation  are  derived 
and  simple  approximate  solutions  are  provided.  A  robust  scheme 
for  quantile  selection  is  also  developed. 


1,  INTRODUCTION 

Assume  that  a  random  sample,  X^,...,Xn,  has  been . obtained 

from  a  distribution  of  the  form  F(x)  ■  F  ,  where  F  is  a  known 

U  0  U 

distributional  form  and  u  and  o  are,  respectively,  location  and 
scale  parameters.  This  note  is  concerned  with  the  estimation  of 
U  and  q  by  the  asymptotically  best  linear  unbiased  estimators 
(ABLUE's)  based  on  k  <  n  sample  quantiles. 


h 
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11. 1) 


Define  the  sample  quantile  function  by 

«'“>  -*«>'  . . 

where  x^j  is  the  jth  sample  order  statistic;  then,  given  a  spac¬ 
ing  T  =■  {u^,...,^}  (k  real  numbers  satisfying  0  <  u1  <  ...  <  uk 
<  1)  the  ABLUE's  are  easily  computed  linear  functions  of  the  Q(u^) 
i  *  l,...,k.  Explicit  estimator  formulae  as  well  as  expressions 
for  the  asymptotic  efficiency  of  the  ABLUE's  relative  to  the 
Cramer- Rao  lower  variance  bounds  can  be  found,  for  example,  in 
Chapter  5  of  Sarhan  and  Greenberg (1962) ,  in  Cheng (1975)  or  Eubank 
(1981a).  Consequently,  they  will  not  be  repeated  here.  As  these 
formulae  all  involve  the  spacing,  T,  the  problem  we  address  is  the 
selection  of  spacings  that  have  optimal  properties  for  certain  func 
tions  of  the  estimators'  asymptotic  relative  efficiencies  (ARE's). 

Before  proceeding  further  it  should  be  noted  that  spacing 
selection  for  the  ABLGE  is  related  to  several  other  problems 
including  those  which  derive  from  i)  regression  design  for  time 
series  with  Brownian  bridge  error  processes,  ii)  variable  break¬ 
point  L2[0,1]  piecewise  constant  approximation,  iii)  grouping 
selection  for  the  asymptotically  most  powerful  group  rank  test 
for  two  sample  location  and  scale  problems  and  iv)  problems  of 
optimal  stratification  and  grouping  (see  Gastwirth  (1966) , 

Adatia  and  Chan (1981)  and  Eubank (1982)  for  discussions  of  some 
of  the  relationships  between  these  problems) .  Consequently, 
the  results  presented  here  have  applications  in  these  areas 
as  well.  Of  particular  importance  for  this  article  is  the  con¬ 
nection  between  spacing  selection  and  problems  i)  and  ii)  which 
is  used,  implicitly,  in  subsequent  sections.  For  more  detailed 
discussions  of  this  relationship  and  further  background  material 
on  the  ABLUE  see  Eubank(1981a,b)  and  Eubank,  Smith  and  Smith (1981) 
represent  the  set  of  all  k-element  spacings  and, 


for  TeD^,  denote  the  ASLUE's  and  their  corresponding  variance- 

covariance  matrix  by  (y (T)  (  crfT)^  and  —  A(T)  respectively. 

n 

The  joint  APE  of  the  ASLUE’s  is  then  given  by 

APE (y (T) ,  0( T))  -  |A(T)|/|A|  (1.2) 

where  A  is  the  usual  intrinsic  accuracy  matrix.  The  construction 
of  spacings  which  maximize  (1.2)  is  both  mathematically  and  numer¬ 
ically  intractable  for  most  distributions.  This  has  lead  to 
consideration  of  other  optimality  criteria.  For  example, 

Hassanien (1969a,  1969b)  and  Eisenberger  and  Posner (1965)  choose 
spacings  that  minimize  the  stun  of  the  (asymptotic)  estimators' 
variances  which  is  equivalent  to  minimizing  the  trace  of  A(T)  \ 
denoted  tr  A (T)  .  Another  alternative,  proposed  by  Kassanein 

(1977)  is  maximization  of  the  sum  of  the  estimators'  APE's  or, 
equivalently,  maximizing  trA(T)B_1  where  B  is  a  diagonal  matrix 
consisting  of  the  diagonal  elements  of  A.  In  Section  2  the 
asymptotic  (as  k-»)  properties  of  these  alternative  spacing  selec¬ 
tion  schemes  are  derived  and  simple  approximate  solutions  are 
provided.  In  each  case,  the  solution  is  in  the  form  of  a  density 
function,  h,  on  [0,1].  The  sequence  of  spacings  { T^} ,  T^  e  D^, 
whose  kth  element  consists  of  the  (k+1) -tiles  of  h  is  called  the 
regular  sequence  generated  by  h ,  denoted  PS  (h) ,  and  for  k  suf¬ 
ficiently  large  and  optimal  h,T^  is  the  proposed  approximate 
solution. 

In  Section  3  robust  spacing  selection  is  considered.  The 
problem,  in  this  case,  is  to  select  a  (spacing)  density  that  is 
optimal  relative  to  a  known  finite  set  of  probability  laws.  The 
resulting  solution  provides  an  asymptotic  analog  of  a  procedure 
suggested  by  Chan  and  Rhodin(1980) . 

2.  ASYMPTOTICALLY  OPTIMAL  SPACINGS 
Denote  the  quantile  function  corresponding  to  FQ  by  Qq. 


Assume  that  FQ  admits  a  density,  fQ,  and,  hence,  a  density-quan¬ 
tile  function  dg(u)  *  f^fQ^fu)),  0  <_  u  1.  Throughout  this 
section  we  require  that  both  dQ  and  the  product  d0"2g  twice 
continuously  differentiable  on  [0,1]  and  vanish  at  the  ends  of 
the  interval.  We  also  adopt  the  notation 

<Ku)  -  Cd"(u),(d0*Q0)"<u))t.  (2.1) 

Using  n  to  denote  any  one  of  the  criteria  considered  in 
Section  1,  i.e.,  |a(T)|,  trA(T)"1  or  trA(T)B_1,  we  now  define 
and  illustrate  our  concepts  of  asymptotic  optimality  for  spacing 
sequences.  A  more  detailed  development  of  these  topics  and  other 
results  in  this  section  can  be  found  in  Eubank (1981b) .  A  sequence 
of  spacings  {t^}  is  called  asymptotically  nl-optimal  if 

lira[n (A) -sup  n (A(T) ) ] [n (A)  -  n (A (T.) ) ]-1  -  1  (2.2) 

kr*»  TcDk  * 

and  asymptotically  n2-optimal  if 

tim[inf  nfAtT)"1)  -  n(A"1)][n(A(T  )_1)  -  nCA"1)]-1  *  1.  (2.3) 

lc**  TeDk 

In  the  case  of  the  determinant  criterion  it  was  shown  in  Eubank 
(1981a)  that  a  sequence  of  optimal  spacings  for  (1.2)  satisfies 

lim  k2[  |  a|  -  sup |  A (T)  |  ]  -  •^(/1['Hu)  V1^  (u)  ]1/3du}3  5  X*/12  (2.4) 
k-H-  TEDk  “  0  ° 

and  that  the  RS  generated  by  h^tu)  ■  .[♦(u)tA~H(u)  ]1/>3/XD  is 
asymptotically  nl-optimal.  Thus  (2.2)  has  the  interpretation  that, 
for  Cr*}RS(hQ),  |a|  -  |a(T£)  |  converges  at  the  same  rate  (namely 
0(k  2) )  with  the  same  asynqptotic  constant  (Xp/12)  as  | a| -sug^A(T) | 
Similar  interpretations  hold  for  the  other  cases  that  are  con¬ 
sidered. 

If  n is  now  taken  as  the  sum  of  ARE's  it  follows  by  arguments 
similar  to  those  in  Eubank (1981a)  and  Theorem  4.1  of  Sacks  and 
Ylvisaker (1968)  that  a  sequence  of  spacings  obtained  by  maximizing 
trACDB”1  satisfies 


lim  k  ItrAB  -sup  trA(T) B  J 


-r4(/‘1'(t|/{u)tB  ]1//3du}3  =  1^/12. 

0 


(2.5) 


An  approximate  (asymptotic)  solution  is  provided  by  the  RS  ob¬ 
tained  from  h  (u)  “  [tMu^B  ^  (u)  ]1//3/xc  which  is  asymptotically 
nl-optimal.  For  spacings  chosen  to  minimize  the  sum  of  the 
estimators  variances  we  have  (see  Theorem  4.5  of  Sacks  and 
Ylvisaker (1968)  and  the  subsequent  remark) 
lim  k2[inf  tr  A(T)"1  -  trA*1] 


k-*»  TeD, 


(2.6) 


/a[<Mu)  V^(u)]1/3du}3  =  1^/12. 


An  asymptotically  n2-optimal  sequence  for  this  criterion  is  pro¬ 
vided  by  the  RS  generated  by  hy(u)  •  [iKu^A  2iMu) 

The  diagonal  nature  of  B  has  the  consequence  that  hg  will 
be  easier  to  use,  in  general,  than  hQ  or  h^  for  spacing  compu¬ 
tation.  Moreover,  we  see  from  (2.4)  and  (2.5)  that  maximizing 
|A(T) |  and  trA (T) B  1  are  asymptotically  equivalent  procedures  for 


symmetric  distributions  as,  in  this  case,  B  ■  A  (this  explains  the 
similarity  between  these  two  solutions  observed  by  Kulldorff (1963) 
for  the  normal  distribution) .  The  same  cannot  be  said  for  spac¬ 
ings  which  minimize  the  sum  of  the  variances. 

The  computation  of  asymptotically  optimal  spacings  from  hQ, 
h  ;.,  and  h^  will  usually  require  the  use  of  numerical  methods 
(c.f.  Eubank (1981a) ) .  A  distribution  admitting  a  closed  form  solu¬ 
tion  is  the  Cauchy  where  asymptotically  optimal  spacings  are 
provided  by  uniformly  spaced  points  (i.e.,  T^  *  (j^-;  *  ”  l,...#kl 
in  all  three  cases. 

3.  ROBUST  SPACING  SELECTION 
Zn  this  section  we  relax  the  assumption  that  Fn  is  known 


precisely  and  assume,  instead,  that  FQ  is  known  only  to  belong 
to  a  given  finite  set  of  probability  laws,  L.  The  problem  now 
is  to  select  spacings  that  are  robust  relative  to  L. 

Consider  first  the  case  of  location  parameter  estimation. 

For  L,  GcL  let  T (G) eD^  denote  the  optimal  spacing  for  G  and  let 
ARE  Cu  (T  (G) )  |  L)  be  the  ARE  for  T  (G)  when  L  is  the  true  underlying 
distribtuion.  Chan  and  Rhodin(1980)  suggest  choosing  a  spacing 
T(G*)  that  satisfies 

min  ARE ( u (T (G*) ) | L)  =  max  min  ARE  (v  (T  (G) )  |  L)  .  (3.1) 

Lei  Gel  LeL 

This  solution  provides  a  candidate  for  FQ,  namely  G*,  and  V  is 

estimated  accordingly.  A  spacing  selected  using  (3.1)  is  an 

element  of  {T(G);GeU  which  maximizes  the  guaranteed  asymptotic 

relative  efficiency  (GARE) ,  min  ARE (u (T (G) ) | L) ,  and  is  robust  in 

Get 

this  sense  of  providing  maximum  GARE  over  the  optimal  spacings 
for  laws  in  L.  A  disadvantage  of  this  approach  is  that  tedious 
computations  must  be  performed  for  each  value  of  k.  We  now 
present  an  asymptotic  (as  alternative  to  (3.1)  that  alle¬ 

viates  this  difficulty. 

Let  H  denote  a  finite  set  of  bounded  piecewise  continuous 
density  functions  on  [0,1]  where,  for  heW,  the  set  of  points 
where  1/h  vanishes  or  is  discontinuous  is  assumed  to  have  content 
zero  and  neither  0,  nor  1  as  an  accumulation  point.  Also  assume 

that  for  each  Let  the  corresponding  density-quantile  function  d 

2  2  L 
i’s  in  C  (0,1)0 L  (0,1]  and  monotone  near  0  and  1.  It  then  follows 

from  Pence  and  Smith(19Sl)  that  for  any  heH  and  Lei  if  (T^}  is 

RS  (h)  then 

lim  k2a11(L).[l-ARE(y(Tk)|l-)]  «  /1[d£(u)  ]2[h(u)  ]"2du  (3.2) 


where  an(L) 


is  the  element  that  corresponds  to  u  in  the  infor- 


mation  matrix  for  L,  A  say.  Thus,  what  distinguishes  between 

L 

the  performance  of  spacing  sequences  generated  by  the  densities 
in  H  is,  for  fixed  L,  the  asymptotic  constant  in  (3.2).  An 
element  of  H  that  is  optimal  relative  to  L  is  therefore  provided 
by:  Choose  h *eH  to  satisfy 

max  /z  [d"  (u)  ]2[h*(u)  ]~2du  =  min  max  /^d"  (u)  ]2[h(u)  ]_2du.  (3.3) 

LcL  0  *  heH  Lei.  0  L 

One  can  then  estimate  y  using,  for  instance,  the  law  at  which 
r  1  2  •»  2 

0  th*(u)]  du  is  minimized.  Alternative  conditions 

under  which  (3.3)  is  valid  cam  be  deduced  from  Theorem  4.4  of 
Pence  and  Smith(1981)  and  Theorem  3.1  of  Sacks  and  Ylvisaker (1968) . 

A  logical  choice  for  H  would  seem  to  be  the  set  of  optimal 
densities  for  location  parameter  estimation  corresponding  to  the 
various  laws  in  L,  {|  d"  (u) |  2/3//1|  d" (s)|  2/3ds  ;  L cL}  (see 

1j  Q  L 

Eubank (1981a) )  .  Choosing  H  in  this  manner  we  now  compare  spacings 

selected  using  (3.3)  to  those  obtained  by  Chan  and  Rhodin(1980) 

using  (3.1).  As  they  restrict  attention  to  laws  that  are 

(essentially)  members  of  the  Tukey  lambda  family  we  shall  do 

likewise  and,  also  for  comparison  purposes,  take  k  «*  5.  Using 

L(A)  to  denote  that  member  of  the  Tukey  lambda  family  having 

shape  parameter  A,  a  comparison  of  spacing  GARE's  for  the  two 

procedures  is  provided  in  Table  1  for  a  few  selected  choices  of 

L.  It  is  important  to  note  that  a  spacing  obtained  using  (3.1) 

is  not  a  spacing  that  maximizes  the  GASE  over  all  TeD  (c.f. 

k 

Chan  and  Rhodin(l980,  p.  236)).  iThus  spacings  obtained  using 
the  asymptotic  approach  may,  in  fact,  result  in  larger  GARE's 
as  is  illustrated  by  the  case  of  L  -  (L(-.l),  L(0),  L(.l),  L(.14)} 
and  -  (L(-.6),  L  (- .  5) ,  L(-.4) ,  L(-.3)}. 


(L(0) ,L(.l) ,£(.14)} 

.9332 

.9111 

{L(-.1),L(0),L(.1),L(.14)} 

.9015 

.9080 

{L(-.6)  ,L(-.5)  ,L(-.4)  ,L(^3)} 

-.9568 

.9581 

{£(-2.0) ,£(-1.8) ,£(-1.6)  , 

£(-1.4)  ,£(-1.2)  } 

.9207 

.9056 

Of  course  there  is  no  reason  to  restrict  attention  to  location 

parameter  estimation.  A  scale  parameter  version  of  (3.3)  is 

readily  obtained  by  replacing  d  with  the  product  d  *Q  in  the 

1*  L  L 

previous  discussion  where  Q  is  the  quantile  function  for  Lei. 

L 

For  the  estimation  of  both  y  and  a  observe  that  if  heH  and  f T^} 
is  RS(h),then  from  the  preceeding  comment  and  (3.2) 

lim  k2tr[A  -A  (T.  )  (ujS"1*.  (u)}[h(u)j"2du  (3.4) 

k—  L  L  •lc  L  12o  L  L  L 

where  iji,A(T)  and  B  have  now  been  subscripted  to  indicate  their 
dependence  on  L.  Thus  one  approach  to  simultaneous  parameter 
estimation  can  be  based  on  (3.4).  In  this  case  H  might  be  chosen 
to  consist  of  the  optimal  densities  for  trA^TJB^,  Le  L,  given  in 
Section  2.  Analogs  of  (3.4)  can  also  be  obtained  for  the  other 
criteria  that  have  been  considered. 
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The  problem  of  quantile  selection  for  the  asymptotically  best  linear 
unbiased  estimators  of  location  and  scale  parameters  is  considered.  The 
asymptotic  properties  of  several  quantile  selection  methods  for  simultaneous 
parameter  estimation  are  derived  and  simple  approximate  solutions  are  provi¬ 
ded.  A  robust  scheme  for  quantile  selection  is  also  developed. 
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